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Crystallography may be the gold standard of protein structnre determination, but obtaining the 
necessary high-quality crystals is also in some ways akin to prospecting for the precious metal. 
The tools and models developed in soft matter physics to understand colloidal assembly offer some 
insights into the problem of crystallizing proteins. This topical review describes the various analogies 
that have been made between proteins and colloids in that context. We highlight the explanatory 
power of patchy particle models, but also the challenges of providing guidance for crystallizing 
specific proteins. We conclude with a presentation of possible future research directions. This article 
is intended for soft matter scientists interested in protein crystallization as a self-assembly problem, 
and as an introduction to the pertinent physics literature for protein scientists more generally. 


I. INTRODUCTION 

Biological macromolecules are central to life processes. 
Although some of these processes can be characterized 
at a relatively coarse scale, more often than not a mi¬ 
croscopic understanding of the structure and dynamics 
of the involved biomolecules is also essential[T]. Pro¬ 
teins, for instance, interact with each other and their 
environment through fine-tuned structural features so as 
to perform their biological functions HHH; reciprocally, 
protein malfunction is often due to structural defects, 
and may result in diseases [7j. Proteins also represent 
some of the most sophisticated nano-machines known, 
having been shaped by natural selection. From an engi¬ 
neering perspective, few other systems offer more insights 
into designing devices on that size scale [5]. Our limited 
knowledge of protein structures thus limits our compre¬ 
hension of biological phenomena, our ability to discover 
new drugs, and our design of bio-inspired materials [i- 
[U. For that reason, sizable research efforts are being 
expended on extracting detailed protein structures and 
dynamics nans]- 

As a field, structural biology mostly relies on pro¬ 
tein structures obtained from diffraction-based methods. 
Since X-ray crystallography enabled the structure deter¬ 
mination of myoglobin and hemoglobin in the 50’s, both 
the amount and quality of structural information have 
greatly increased El- Protein crystallography, however, 
requires protein crystals, whose obtention is often the 
limiting experimental step nans]. Expertise from many 
different fields - from computational biology and robotics 
to surfactant science - has thus been brought to bear 
on the problem [5|. This article specifically reviews the 
contribution of soft matter physics to understanding the 
crystallization of globular proteins [TT]. Although the sci¬ 
entific conversation between the soft matter and struc¬ 
tural biology communities has thus far mostly focused 
on providing a physical rationale for experimental ob¬ 
servations, recent conceptual advances and an increased 
back and forth between theory and experiment suggest 


that richer exchanges may soon become the norm. In 
the following, we recapitulate how this development has 
come to be. We present in Sect. [ll| and |III| an overview of 
the basics of protein crystallography and protein crystal 
assembly, respectively, in order to motivate the practical 
importance of physical insights. We then discuss various 
isotropic (Sect. IV) and patchy (Sect. models of pro¬ 
tein assembly. Section |V^ concludes with a discussion of 
possible future research directions. 

Note that we aim here to provide a reasonably broad 
physical description of protein crystallization and its con¬ 
straints. Our target readership are soft matter scientists 
with either a novel or a renewed interest in the prob¬ 
lem. We thus describe some more elementary aspects 
of the underlying structural biology and crystallization 
technology. We intend, however, the later sections to 
also serve as an introduction to the soft matter literature 
for protein scientists more generally. 


II. DIFFRACTION AND PROTEIN 
CRYSTALLIZATION 

Proteins are encoded in DNA as a series of base pairs, 
which are then translated into a sequence of amino acids, 
forming the primary structure of the molecules. To per¬ 
form its functions, a protein must also hierarchically 
fold into its secondary and tertiary structures. It is the 
properties of the resulting three-dimensional object that 
largely determine how the molecule interacts with its en¬ 
vironment. This connection between function and shape 
is at the root of both structural biology and structural 
genomics. 

Advances in high-throughput sequencing have 
markedly increased the number of protein-encoding 
genes and thus of known primary structures. Knowledge 
of the amino acid sequence alone, however, currently 
provides but limited insights into the higher-order 
structure of a protein. Predicting a protein’s full tertiary 
structure from its sequence is indeed a remarkably 
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complex task |18j . Because for most purposes protein 
structures cannot be inferred or calculated, they must 
be determined experimentally. The classical and most 
frequently used approaches for doing so are X-ray and 
neutron crystallography. These solid-phase diffraction 
methodologies can handle proteins containing several 
thousands of amino acids and reach sub-atomic reso¬ 
lution (< 1.5 A). By contrast, in spite of important 
methodologically advances, nuclear magnetic resonance 
(NMR) still cannot resolve the structure of solvated 
proteins containing more than two to three hundred 
amino acids, i.e., a few tens of kilodaltons m m- 
NMR-resolved structures thus represent only a small 
(although increasing) fraction of the protein data bank 
(PDB). Recent advances have also pushed single-particle 
electron cryomicroscopy near atomic resolution [21H23j . 
However, (sub-)atomic resolution may remain physically 
unattainable because of radiation damage, beam-induced 
movement and charging of the sample [23] . The classical 
diffraction methodologies are thus likely to remain the 
reference for the foreseeable future. Interestingly, new 
diffraction-based techniques are also under development. 
Electrons, which scatter fairly strongly from molecules, 
can provide a diffracted image of a protein’s Coulomb 
potential [24], and the intense X-ray pulses from free- 
electron lasers (FEL) provide high-resolution structural 
information from relatively small crystals [23] . 

Whatever the chosen diffraction approach may be, 
three main steps must be performed in order to prepare a 
sample: (i) the protein must be expressed in a sufficient 
amount, which is typically achieved by using plasmids 
in cultured bacterial cells; (ii) the protein must be pu¬ 
rified, in order to isolate it from the biological medium 
used to express it; and (iii) the purified protein molecules 
must be assembled in an ordered and well-packed crystal¬ 
lite, whose minimal size depends on the scattering inten¬ 
sity of the diffracted radiation (from tens of nanometers 
or less on the side for FEL to millimeters for neutron 
beams [261 [27]). Although each of these steps presents 
several experimental challenges, crystallization is by far 
the most troublesome [SHniiin]- Under standard bio¬ 
logical conditions, most proteins do not easily crystal¬ 
lize. They likely evolved to limit aggregation that inter¬ 
feres with their normal biological function [2H1 [IH] (with a 
few notable exceptions [1130]). Some neurodegenerative 
disorders, such as Alzheimer’s and Parkinson’s diseases, 
have indeed been linked to protein solubilization failure 
resulting in unwanted aggregation m, and cataract for¬ 
mation can directly involve the crystallization of eye-lens 
proteins [32ll35j . Hence, in order to promote their peri¬ 
odic assembly, proteins must be placed in chemical con¬ 
ditions that are typically far from those encountered in 
biological systems, yet must not result in a loss of sec¬ 
ondary or ternary structure. 

Even within those strict constraints, the chemical 
space within which to locate conditions that promote 
crystallization is remarkably vast. Exhaustive searches 
are simply beyond experimental reach |36| . Crystallog- 


raphers have thus developed chemical screens summariz¬ 
ing conditions that have been previously successful. For 
globular proteins, these are aqueous solutions containing 
various combinations of three families of co-solutes: in¬ 
organic salts, polymers (e.g., polyethylene glycol), and 
small organic molecules [9]. Technological advances in 
automatizing the experimental process currently allow 
for up to thousands of these crystallization cocktails to 
be tested at once [37], but even that number is only a 
minute fraction of the full spectrum of chemical possibil¬ 
ities. 

More crucially, existing screens are far from guaran¬ 
teeing crystal formation. Despite accrued experience 
and improved experimental techniques, successful crys¬ 
tallization indeed remains the exception rather than the 
rule. On average, only 0.04% of crystallization experi¬ 
ments generate good-quality crystals, which makes them 
quite time consuming and expensive to obtain m- Mem¬ 
brane proteins are even harder to crystallize [3H]- Yet as 
long as diffraction-based methods remain the most desir¬ 
able way to determine protein structures, a better under¬ 
standing of the physical mechanisms underlying protein 
crystal assembly is the only possible path towards a high- 
throughput scheme comparable to next-generation DNA 
sequencing [T2| [39]. Brute-force approaches are unlikely 
to succeed on their own. 


III. SOLUBILITY PHASE DIAGRAMS AND 
PROTEIN CRYSTALLIZATION 

From a thermodynamic viewpoint, protein crystalliza¬ 
tion is a standard phase transition whereat the chemical 
potential of protein molecules dispersed in an aqueous 
solvent is equal to that of those ordered in a crystal. The 
transition is in some ways similar to water freezing into 
ice, suggesting that liquid-state descriptions may pro¬ 
vide useful microscopic insights. Understanding protein 
crystallization, or any other phase transition, should in¬ 
deed follow from knowing the (effective) interactions be¬ 
tween protein molecules [2] . In other words, a de¬ 

tailed description of protein-protein interactions should 
also provide the protein phase behavior (Fig.[^ [13] ■ Yet, 
although the underlying forces through which proteins in¬ 
teract, i.e., hydrogen bonding, van der Waals, hydropho¬ 
bic, and electrostatic interactions, are individually fairly 
well characterized [HIBH] , their collective contribution to 
protein assembly is much less well understood [49l [50] . A 
way to somehow coarsen microscopic details into a sim¬ 
pler, effective description of protein pair interactions is 
thus needed. 

Phase diagrams capture graphically the regions of 
parameter space over which different phases of mat¬ 
ter are thermodynamically stable. For mixtures, the 
phase diagram is multi-dimensional, but in some sys¬ 
tems two-dimensional projections on the temperature- 
concentration plane of a key component, e.g., the protein, 
recover a significant fraction of that information. For 
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crystallization cocktails, however, this projection typi¬ 
cally involves rescaling the temperature axis by an ag¬ 
gregate function of the solution conditions. Co-solutes 
can indeed change the energy scale over which protein 
molecules interact with one another. For many pro¬ 
teins, this projection displays phases that are analogous 
to those of a single-component system, for which the 
two-dimensional projection is complete. Crystal, liq¬ 
uid, vapor, and supercritical fluid regimes are indeed 
identifiable. The vapor phase corresponds to a low- 
concentration of proteins in solution and the liquid to 
a high-concentration. The two phases are separated by a 
first-order transition that terminates at a second-order 
critical point, above which a supercritical fluid is ob¬ 
served. Because protein solutions are mixtures of pro¬ 
teins, water, and other additives, however, the typical 
properties of phases in a protein system may significantly 
differ from what is typically observed in simple fluids. A 
protein crystal, for instance, contains on average more 
than 40% water m and may embed co-solutes in frac¬ 
tions that differ from what is left in the crystallization 
cocktail. 

Phase diagrams for about a dozen proteins, including 
lysozyme, 7 -crystallin, insulin, and myoglobin, have been 
experimentally determined under different solution con¬ 
ditions [5M55] . Intriguingly, these phase diagrams have 
in common a topology that is qualitatively different from 
that of simple liquids (Fig. [53] ■ In simple liquids, 
the first-order phase transition between a vapor and a 
liquid phase terminates at a stable second-order critical 
point that lies above the crystal solubility curve, whereas 
in proteins the critical point is typically situated below 
the solubility line. The protein gas-liquid binodal and 
critical point are, therefore, metastable with respect to 
gas-crystal coexistence (Fig. [^. 

It is also empirically found that successful crystalliza¬ 
tion typically is most common when a protein solution 
drop is prepared in the supersaturated region between 
the solubility line and the metastable critical point - a 
region that is sometimes called the nucleation zone or 
the crystallization gap (Fig. mmm- If it is pre¬ 
pared at conditions below the critical point, the system 
has a propensity to aggregate in a disordered, percolat¬ 
ing network, i.e., a gel; the gelation probability increases 
with quench depth [531121]. Such aggregation, although 
thermodynamically metastable with respect to crystal as¬ 
sembly, forms much more rapidly and is often long-lived. 
It thus reduces the likelihood of successful protein crys¬ 
tallization. 

Based on this description, given a protein phase di¬ 
agram, protein crystallization should be easily achieved 
(Fig[T]). Unfortunately, experimental determination of 
protein phase diagrams (considering the vast number of 
possible co-solutes) is a lot more time- and resource¬ 
consuming than even the most ambitious crystallization 
screen. Hence, although physically elegant, a direct ap¬ 
plication of thermodynamics principles is of limited prac¬ 
tical relevance. This picture is also overly reductive be- 
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FIG. 1. Schematic description of the vicious circle of pro¬ 
tein crystallization. Crystals are needed to obtain protein 
structures through diffraction (green arrow), but these crys¬ 
tals would be more easily obtained if experimental phase di¬ 
agrams were known (black arrow). These phase diagrams 
are, however, expensive to determine. Phase diagrams of 
coarse-grained protein models may be accessible via simula¬ 
tions (blue arrow), but an appropriate protein model should 
stem from a detailed understanding of protein-protein inter¬ 
actions (red arrow), the computation of which requires some 
knowledge of the protein structure (purple arrow). By contex¬ 
tualizing the observed protein behavior, however, soft matter 
models could provide guidelines to refine and iterate the pro¬ 
cess. 

cause a protein’s structure may also respond to chang¬ 
ing solution conditions. In order to gain useful insights 
about protein crystal assembly, one must therefore ex¬ 
tract the key features of their phase diagrams from a lim¬ 
ited amount of information. Because atomistic models of 
crystal assembly would be computationally intractable 
(simulating a simple protein can be quite demanding, 
let alone hundreds of them), coarse-grained models are 
the only viable option to fill in the missing information 
(Fig. 0. Conveniently, self-assembly has been heavily 
studied in related models over the last couple of decades. 

IV. ISOTROPIC MODELS OF PROTEIN PHASE 
BEHAVIOR 

The field of soft matter physics has grown from a de¬ 
sire to understand the physics behind the assembly of 
squishy, mesoscale objects, such as polymers, liquid crys¬ 
tals, grains, and cells. From the self-assembly of these 
objects emerges a complex yet often universal array of 
material behaviors, including glass formation and jam¬ 
ming, ordered microphases, and tissue growth. Soft ma¬ 
terials are also particularly appealing because a large part 
of their complexity can be captured by purely classical 
models, and because the relevant range of these model’s 
parameters is much wider than for atomic-scale simple 
liquids. 

The study of the rich and robust phenomenology of 
colloidal suspensions, which in some ways epitomizes the 
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field of soft matter, opens the door to understanding pro¬ 
tein assembly. In the mid-80’s, suspensions of purely re¬ 
pulsive colloidal particles were first observed to crystallize 
similarly to classical hard spheres [HS]. Adding a deple- 
tant, i.e., a soluble and inert co-solute much smaller than 
the colloids, was understood to result in a net pair attrac¬ 
tion between particles [SSIET], and, based on the law of 
corresponding states [33], was expected to result in the 
presence of a gas-liquid coexistence region. Experiments 
showed instead that adding a depletant unavoidably re¬ 
sults in gel formation jSSj. 

A possible resolution to this discrepancy emerged from 
the work of Lekkerkerker and Frenkel |69||70], who found 
that decreasing the attraction range depresses the gas- 
liquid coexistence region more than it lowers the crystal 
solubility curve (Fig. [^). For particles with a square- 
well attraction range Act < 1.25cr, relative to particles of 
diameter cr, the critical point thus falls below the solubil¬ 
ity curve. Different models with short-range attraction 
confirmed the qualitative robustness of this result |II^ 
[75], and its universality was synthesized in an extended 
law of corresponding states |75H5T] . 

The change in behavior in going from long- to short- 
range attraction can be intuitively explained by con¬ 
sidering the energy-entropy balance in the liquid and 
crystal phases. For long-range attraction, there ex¬ 
ists a relatively broad concentration-temperature range 
within which liquid particles are close enough to ben¬ 
efit from each other’s attraction while maintaining the 
high entropy characteristic of disordered configurations 
(Fig. [^). The liquid free-energy can therefore be lower 
than that of the crystal, despite the crystal having a lower 
potential energy. When the attraction range is reduced, 
however, particles in the liquid have to be much closer to 
each other in order to feel their attraction (Fig.[^). This 
constraint drastically reduces the number of low-energy 
configurations, and thus the liquid entropy. By contrast, 
for interaction ranges as low as Act = I.IOct, which is 
comparable to the interparticle distance in a hard-sphere 
crystal near melting |59j . the crystal entropy remains al¬ 
most unchanged. As a result, the liquid free-energy raises 
above that of the crystal and the liquid phase ceases to 
be thermodynamically stable (Fig. [^). 

The gas-liquid coexistence, however, remains 
metastable; theoretical and experimental work has 
showed that its dynamical influence does not disappear. 
Homogeneous systems that are supersaturated near the 
metastable critical point indeed experience critical fluc¬ 
tuations that lower the barrier to crystal nucleation |82| . 
In addition, if quenched below the critical point, these 
systems first undergo a spontaneous phase separation 
that is well described by spinodal decomposition. It 
is this process that gets arrested by the sluggish par¬ 
ticle dynamics in the dense phase, and results in the 
emergence of a percolating gel network [551 - 155] . 

This phase behavior is stunningly reminiscent of that 
of many proteins, as described in Sect. [ml The anal¬ 
ogy between short-range attractive colloids and proteins. 


which caught the attention of a number of soft mat¬ 
ter scientists in the mid-90’s [53] [82l [HU [9^, is further 
supported by microscopic evidence. First, many globu¬ 
lar proteins are roughly spherical objects with a fairly 
short-range attraction. Their diameter is of at least few 
nanometers, while the mechanisms that result in pair 
attraction once electrostatic repulsion is screened, e.g., 
hydrogen bonding, salt bridges, and hydrophobicity, ex¬ 
tend only over a few angstroms. These attractive forces 
are thus felt within about 10% of the protein diame¬ 
ter, which falls well within the specification for short- 
range attraction. Second, George and Wilson observed 
that the optimal solution conditions for protein crys¬ 
tallization were consistent with being in proximity of a 
metastable critical point [3TH35] . Third, contemporary 
studies of the PDB did not identify any statistical signa¬ 
tures of a preferred relative orientation of proteins chains 
in crystals, which was consistent with protein-protein 
attraction being roughly orientation-independent, i.e., 
isotropic [SJ] [3S]- This last suggestion, in particular, 
would prove to be overly simplistic, as we discuss in 
Sec. [V] A productive effort at characterizing short-range 
isotropic models and subsequently rationalizing the phase 
diagram [73 [M] and solution behavior [37H33] of specific 
proteins as well as making generic arguments about het¬ 
erogeneous nucleation |1001 llOlj from that perspective 
nonetheless ensued. 


V. PATCHY MODELS OF PHASE BEHAVIOR 

Though appealing in their simplicity, colloidal descrip¬ 
tions based on isotropic interactions between proteins 
miss some of the key phenomenology of globular pro¬ 
teins [3511331 [TimiTini . First, isotropic interactions sys¬ 
tematically result in densely-packed crystals, such as 
the face-centered-cubic (FCC) or body-centered-cubic 
(BCG) lattices (Fig.[^, while crystals of biomolecules are 
fairly empty. Their packing fraction 77 , i.e., the fraction 
of space physically occupied by proteins, is between 0.3 
and 0.5 (compared to 0.74 and 0.68 for FGG and BCG, 
respectively), even though the protein chains themselves 
are essentially close packed m- Second, the width 
of the region identified by the metastable vapor-liquid 
line in experimental protein phase diagrams does not 
match the expectations for particles with isotropic in¬ 
teractions [1041II0611107] . Third, point mutations that 
change a single amino acid residue at the protein surface 
can dramatically alter the topology of a protein phase 
diagram [53 1108j , while if attraction were truly isotropic 
such a small perturbation would be expected to have but 
a fairly limited impact. 

From a (bio)chemical viewpoint, these observations are 
far from surprising. Protein surfaces are heterogeneous 
and complex. There is thus no reason to believe that dif¬ 
ferent relative protein orientations should result in sim¬ 
ilar interactions. Unlike depletion attractions, hydrogen 
bonding and salt bridges are strongly directional and lo- 
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FIG. 2. A: Typical topology of a temperature T-protein concentration phase diagram, where the effective interaction U can 
be tuned by changing solution conditions, e.g. changing salt concentration and pH, adding depletants, etc. At low saturation 
(yellow in the color bar), the drive to crystallize is insufficient to generate crystals within reasonable experimental time. At 
high saturation (red in the color bar), the drive to assemble is so strong that molecules form amorphous aggregates. In between 
lies the nucleation zone (or crystallization gap), where crystal assembly is possible (green in the color bar). By tuning T or U 
experimentally, the vertical axis can be rescaled and the saturation level controlled. B: The radial range of attraction, Act, of 
the pair interaction potential, U{r), affects the topology of the scaled temperature-packing fraction, Tje-rj, phase diagram. The 
situation is here illustrated for square-well fluids, but is fairly independent of the precise form of U{r). In simple liquids, the 
relative interaction range is sufficiently large, A > 1.25, to make the liquid state entropically stable (top half). In colloids and 
proteins, the short-range attraction, i.e., for A < 1.25, the reduced liquid entropy lowers the critical point below the solubility 
line (bottom half). 


calized; they do not evenly cover the whole protein sur¬ 
face. Even the less chemically specific interactions based 
on hydrophobicity rely on the existence a particularly 
good structural match (recognition) between two parts 
of a protein surface, and thus exhibits a strong orienta¬ 
tion dependence (see Appendix for a more detailed dis¬ 
cussion of specificity). We also now better appreciate, 
thanks to more careful statistical analysis of the PDB, 
that crystal contacts are quite different from randomly 
chosen elements of protein surfaces. They are enriched 
in glycine and small hydrophobic residues, and depleted 
in large polar residues with high side-chain entropy, such 
as lysine and glutamic acids m- A large fraction of 
glycine and alanine on a protein surface further corre¬ 
lates with a higher probability of the molecule having 
been crystallized m- 

A. Simple Patches 

This evidence drove the realization that directional as¬ 
pects of protein pair interactions ought to be taken qual¬ 
itatively into account, in order to obtain reasonable min¬ 
imal models of proteins (Fig. [^. Bonding directionality 
makes the solution behavior of proteins more akin to that 
of associative liquids, such as water, than to that of sim¬ 
ple (isotropic) liquids, which is a fundamental distinction 
in liquid-state theory |59j . Nearly coincident with this re¬ 
alization, came the promise of synthetic colloids with an 


increasing degree of sophistication, such as faceted and 
DNA-coated particles, which led to a boon of interest 
in models with directional interactions, as was recently 
reviewed by Bianchi et al. [Ml. Our aim here is not to 
broadly go over this advance, but to highlight the aspects 
most important to understanding protein crystallization. 

A broad array of schematic models with short-range 
anisotropic, directional attraction, i.e., patchy particle 
models, have since been studied (Fig. [^. Although the 
details of patch geometry and interaction parameteriza¬ 
tion may differ, these patchy models generally have phase 
diagrams |104l I107LI111H113] , fluid properties |114j , and 
assembly pathways [115H117] that are significantly dif¬ 
ferent from those of short-range isotropic models. Most 
saliently for protein assembly, patchiness further lowers 
the metastable critical point, stabilizes open geometries 
such as diamond and cubic crystals, and qualitatively 
change^the shape of the gas-liquid coexistence region 
(Tabled [lOTl ITTS]. This last point is interesting be¬ 
cause experimental phase diagrams of proteins, such as 
lysozyme and y-crystallin, have a binodal whose width 
and critical packing fraction are much smaller than those 
of isotropic models |53l EHl I104j . Although patchiness re¬ 
quires the specification of more model parameters and 
results in a certain loss of universality |107l 1119] , it also 
weakens high-order correlations in the fluid structure, 
which enables the use of relatively simple liquid-state 
descriptions, such as Wertheim’s theory, for their analy¬ 
sis 
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FIG. 3. Increasing the complexity of schematic models is necessary to accommodate the high interaction specificity that controls 
protein assembly. Isotropic interactions, which are well-suited to describe simple liquids or depletion interaction in colloids, 
exclusively depend on the inter-particle distance and assemble in close-packed crystals (left). Thermodynamic stability of low- 
density crystal requires directionality, which is introduced in patchy models by requiring the alignment of the surface patches 
for two particles to interact (middle, red patches have to face one another to interact). However, the highly heterogeneous 
protein surface requires that patches only interact with a single partner patch, and not promiscuously (right, only patches of 
the same color interact). 


Because most of the theoretical studies of patchy par¬ 
ticles were motivated by the promises of colloidal syn¬ 
thesis [MlIM], they often focused on particles with a 
couple of patches [118111281ll30j , or on varying the sur¬ 
face coverage of a single patch, i.e., Janus particles |131F 
[ng. For particles with one to three patches, small 
perturbations to the number of patches indeed dramat¬ 
ically affects the liquid behavior, notably enabling the 
stabilization of empty liquids - liquid states with a van¬ 
ishing density [1181 I134j . If this regime is of interest 
for protein crystal assembly, it is as one to be avoided. 
Monomeric protein crystals display at least six patches, 
which is a minimal for mechanical stability [9511135j . The 
lack of evolutionary pressure for proteins to crystallize 
indeed results in their crystals having most commonly 
the lowest-symmetry chiral (protein chains are chiral) 
point group compatible with that number of contacts, 
i.e., P2i2i2i |136j . Crystals of oligomeric proteins, for 
which evolution has shaped at least the oligomeric con¬ 
tacts, often assemble in complex asymmetric unit cells 
with more than ten distinct patches [13711138j . The com¬ 
petition between these interactions can favor the forma¬ 
tion of small metastable crystallites, hindering the assem¬ 
bly of defectless crystals. Patchy models with a larger 
number of patches have, however, received limited atten¬ 
tion thus far [139] . 


B. Specific Patches 


In addition to being directional, protein-protein inter¬ 
actions are typically specific; there is a one-to-one match 
between pairs of crystal contacts (Fig. [^. Because pair¬ 
wise attraction between proteins depends sensitively on 
molecular details, a given patch often interacts exclu¬ 
sively with a single other patch, to the exclusion of all 
others. 


Interaction specificity has a sizable effect on protein as¬ 
sembly. It obviously affects the liquid entropy and thus 
the position of the metastable critical point relative to 
the solubility curve |1251ll40Vll43j . but more importantly 
it results in interaction heterogeneity. Because each pair 
of interacting patches relies on different physicochemical 
mechanisms, their bonding strength varies [143] . Our re¬ 
cent study of bond energy asymmetry suggest that this 
factor may play a key role in protein crystallizability. En¬ 
ergy asymmetry alone can indeed result in gel formation 
due to percolation as in empty liquids, the closing of the 
crystallization gap, and the restabilization of the critical 
point above the crystal solubility line [143] . Future stud¬ 
ies will surely expand this list. Note that although no col¬ 
loidal particle with specihc patches as complex as those 
of proteins have yet been synthesized, patchy coatings of 
complementary selective DNA strands offer a synthetic 
gateway for obtaining such surface features [127] . 
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TABLE I. Summary of the strengths and weaknesses of short-range attractive models when applied to protein crystallization 


Model 

Strengths 

Weaknesses 


Minimal number of parameters 

Incorrect crystal symmetry 

Isotropic 

Metastable gas-liquid binodal 

Incorrect binodal shape 
Robust to point mutations 


Few parameters 

Unrealistic pair interactions 

Patchy 

Open crystals 
Reasonable bimodal shape 

Fixed geometry 


Correct crystal symmetry 

Many parameters 

Specific patches 

Realistic pair interactions 
Tunable for specific proteins 

Fixed geometry 


C. Measuring Patches 

The loss of generality that accompanies patchiness and 
specificity leads even schematic models of proteins to re¬ 
quire a broad range of system-specific parameters. In 
order to develop a relevant model for a given protein, 
a better description of patch energetics and coverage is 
thus necessary. Although the PDB provides a wealth of 
structural information about crystal contacts |10911144j , 
relating this information to an effective free energy of in¬ 
teraction in solution is not straightforward. Thus far, the 
energetics of very few crystal contacts has been charac¬ 
terized. 

An early effort used a phenomenological model to es¬ 
timate pair interactions from a PDB structure |140j . in 
order to reconstruct the phase behavior of bovine chy- 
motrypsinogen. Existing chemical databases do not suf¬ 
fice, however, to generalize this approach with much ac¬ 
curacy. More recent attempts have used all-atom molec¬ 
ular dynamics simulations of protein pairs in solutions in 
order to extract the angularly-resolved potential of mean 
force |125l I145L I146j . Within the quality of the selected 
molecular force fields, these simulations offer a reason¬ 
able characterization of known crystal contacts. Some of 
these studies were even able to capture the crystal as¬ 
sembly behavior of various proteins. As long as sufhcient 
structural information about patchiness is available, crys¬ 
tal patch energetics can thus be reconstructed reasonably 
well (Fig.0. 

For most systems, however, this information is not 
known a priori. It is thus problematic to measure the 
patch characteristics of proteins that have not yet been 
crystallized or that display patchy interactions that are 
incompatible with the crystal structure. Because for the 
vast majority of proteins no structural information is 
available at all, the former is a genuine difficulty. The 
latter is also important, because evidence suggests that 
non-crystallographic contacts may play a kinetic role 
in protein crystal assembly [147j and may even render 
metastable crystal phases kinetically accessible |189) . 

Yet even if the structure of a protein of interest were 
fully known, brute force molecular dynamics sampling of 
the relative surface of a pair of proteins is well beyond 


computational reach. A hierarchy of methods for identi¬ 
fying candidate attractive patches without complete ex¬ 
haustion is a more promising way forward. Approaches 
that consider protein-protein “docking predictions” as 
well as structural homology, followed by testing of these 
suggestions by higher-precision methods have been used 
for that purpose |125j . but it is still early days for the 
vicious circle of protein crystallization to be broken. 

VI. CONCLUSIONS AND OPEN QUESTIONS 

Soft matter has thus far provided a qualitative physical 
perspective to the problem of protein crystal assembly. In 
order to push the analogy forward, both qualitative and 
quantitative improvements are needed. 

First, as argued in Sec. |V C[ more and higher quality 
information about the protein interactions that give rise 
to patchiness must be obtained. For guiding the crystal¬ 
lization of protein homologues or for optimizing solution 
conditions so as to improve crystal quality, this informa¬ 
tion could be particularly useful. From a computational 
viewpoint, insights into angularly-resolved potentials of 
mean force could also be gleaned from large-scale sim¬ 
ulations. If a sufficiently large database of interactions 
were available, it might even be conceivable to use statis¬ 
tical methods to parametrize the patchiness of protein- 
protein interactions without first extensively simulating 
the system. From an experimental viewpoint, careful 
studies of weak protein-protein interactions, as has re¬ 
cently been done for ubiquitin in solution would also be 
of much help [148) . Studying the structure of transient 
complexes in solution [149] may not only help identify po¬ 
tential crystal contacts, but also competing interactions 
that can hinder crystal assembly. The design of proteins 
that easily crystallize could also be used to validate and 
further enrich the microscopic insights obtained from the 
direct studies of protein-protein interactions [150] . 

Second, richer varieties of patchy models ought to be 
developed in order to address basic qualitative questions 
about protein crystal assembly. 

1. What is the role of competing patches and dimer 
formation? Some proteins are observed to crystal- 


























lize in more than one crystal lattice in the same 
solution composition |151H156] . The type of crys¬ 
tal that assembles can then depend on the initial 
protein concentration as well as on the experimen¬ 
tal time and temperature. 

2. What is the role of internal flexibility? The 
paradigm of a single well-folded protein structure is 
known to be overly simplistic [157) . but it remains 
a key requirement for protein crystal assembly. In¬ 
ternal flexibility of the protein may interfere with 
crystal assembly, but the physical constraints have 
not been given much attention. 

3. What causes inverted solubility? Some proteins 
are characterized by a decreasing protein solubil¬ 
ity with increasing temperature, i.e., an inverted 
solubility |158m60] . Sometimes a single mutation 
can flip the solubility curve [55lll08| or significantly 
change the assembly kinetics [m 1161) . This phe¬ 
nomenon is tentatively attributed to the temper¬ 
ature dependence of the water entropy |162l 1163) , 
but remains poorly understood overall. 

4. What are minimal models for membrane protein 
crystal assembly? As mentioned in the introduc¬ 
tion, the crystallization of membrane proteins typ¬ 
ically involves assembly principles that are different 
than for globular proteins [16411165) . Soft matter 
insights might be helpful in building better experi¬ 
mental guidance for this difficult, yet crucial |166) . 
problem. 

The coming years will thus likely see the emergence of 
increasingly rich patchy models that can provide a clearer 
physical understanding of these and related processes. 

In closing, it is important to note that patchy mod¬ 
els have also found a use in the study of proteins be¬ 
yond crystal assembly. The study of virus capsid for¬ 
mation, in particular, has greatly benefited |167) . The 
aggregation of proteins into amorphous structures, such 
as amyloids, is also within conceptual reach of similar 
approaches [Ml [Ml- The absence of clear microscopic 
information on protein-protein contacts in these systems, 
however, provides an additional challenge. The study of 
protein crystal assembly, which has the inherent advan¬ 
tage of providing structural feedback, when successful, 
may thus lead the way towards a better understanding 
of a broad array of protein-protein interactions. 
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Appendix A: Characterizing crystal contacts: 
specific vs. non-specific protein-protein interactions 

Protein-protein interactions have often classified as be¬ 
ing either specific and non-specific. Biological specificity 
is well-known to be a problematic label |17()[ Ch. 4], but it 
is nonetheless a quite prevalent characterization, includ¬ 
ing of protein interactions that control crystal assembly. 
Its precise meaning, however, differs depending on the 
disciplinary context (Table [ll|). In this appendix, we aim 
to identify these different significations and thus provide 
a brief guide to the reader of the scientific literature on 
the topic. 

In chemistry, specificity distinguishes certain attrac¬ 
tion forces from others, although the classification of 
the various physical mechanisms is not unambiguous |441 
(§ 18.8)]. In biophysics, the distinction between spe¬ 
cific and non-specific interactions typically relies on the 
existence of an energy gap that clearly divides a sin¬ 
gle, strong (specific) interaction from the other (non¬ 
specific) ones [251 1171) . In molecular biology, specific 
interactions are deemed responsible for the stoichiomet¬ 
ric recognition of a given target, while non-specific inter¬ 
actions arise from the promiscuous yet non-biologically 
relevant association of molecules [M) [551 117211173) . Spe¬ 
cific interactions have thus been evolutionarily tuned to 
be free-energetically strong and geometrically oriented, 
while non-specific attractions have not. As mentioned 
above, this general weakness, however, may itself have 
evolved so as to prevent pathological aggregation [281129] . 
Although these three definitions are not necessarily or¬ 
thogonal to one another, we here specifically aim to clar¬ 
ify the last one. 

When applied to crystal contacts, specificity has been 
used to suggest that these biologically non-functional in¬ 
teractions are in many ways indistinguishable from in¬ 
terfaces obtained by randomly bringing two proteins to¬ 
gether [28] . These interfaces do present characteris¬ 
tics that are typical of non-specific interactions: they 
are weaker than functional interaction (of the order of 
few kJ/mol) and they do not have any obvious biolog¬ 
ical purpose. However, they also have unique proper¬ 
ties that distinguish them from randomly selected sur¬ 
face patches [TIM inn) . Specific local protein properties 
are correlated with crystallization and protein surface re¬ 
gions carrying such properties are more likely to be re¬ 
sponsible for the interactions that drive crystal forma¬ 
tion. Hence, crystal contacts are triggered by the basic 
chemical interactions present in any molecular system, 
although they are typically different from the biologi¬ 
cally functional ones. The importance of weak protein 
interactions is not limited to crystal contacts, but is also 
recognized in regard to the formation of transient pro¬ 
tein complexes that are sometimes necessary for correct 
protein function |149) . 
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TABLE II. Summary of the properties that differentiate specific from non-specific interactions in different fields of study 


Field 

Specific interactions 

Non-specific interactions 

Chemistry 

Hydrogen bonds, 
salt bridges 

Hydrophobic, depletion, 
van der Waals, electrostatic 

Biophysics 

Unique, strong, 
energetically gapped 

Many and weak 

Molecular biology 

Evolutionary tuned, strong, 
geometrically constrained 

Weak, 

randomly distributed 


[1] This phenomenon is sometimes playfnlly referred to as 
the revenge of structnral biology. 

[2] T. L. Blundell, H. Jhoti, and C. Abell, Nat. Rev. Drug. 
Discov. 1, 45 (2002). 

[3] P. Kuhn, K. Wilson, M. G. Patch, and R. C. Stevens, 
Curr. Opin. Chem. Biol. 6, 704 (2002). 

[4] T. L. Blundell and S. Patel, Curr. Opin. Pharmacol. 4, 
490 (2004). 

[5] I. Tickle, A. Sharff, M. Vinkovic, J. Yon, and H. Jhoti, 
Chem. Soc. Rev. 33, 558 (2004). 

[6] D. B. Kitchen, H. Decornez, J. R. Furr, and J. Bajorath, 
Nat. Rev. Drug. Discov. 3, 935 (2004). 

[7] M. Congreve, C. W. Murray, and T. L. Blundell, Drug. 
Discov. Today 10, 895 (2005). 

[8] N. Huebsch and D. J. Mooney, Nature 462, 426 (2009). 

[9] A. McPherson, Crystallization of Biological Macro¬ 
molecules (CSHL Press, Cold Spring Harbor, 1999). 

[10] N. E. Chayen and E. Saridakis, Nat. Meth. 5, 147 
(2008). 

[11] N. E. Chayen, in Advances in Protein Chemistry and 
Structural Biology, edited by J. Andrzej (Academic 
Press, London, 2009), Vol. 77, pp. 1-22. 

[12] M. Morange, in History and Epistemology of Molecular 
Biology and Beyond, edited by H. Rheinberger and S. 
Chadarevian (Max Planck Institute for the History of 
Science, ADDRESS, 2006), Vol. 310, pp. 179-188. 

[13] K. Khafizov, C. Madrid-Aliste, S. C. Almo, and A. 
Fiser, Proc. Natl. Acad. Sci. U.S.A. Ill, 3733 (2014). 

[14] J. Nicola, Nature 505, 603 (2014). 

[15] Q. Chen, P. G. Vekilov, R. L. Nagel, and R. E. Hirsch, 
Biophys. J. 86, 1702 (2004). 

[16] T. C. Terwilliger, D. Stuart, and S. Yokoyama, Annu. 
Rev. Biophys. 38, 371 (2009). 

[17] The assembly of membrane proteins typically relies on 
different physical processes [165] . These processes are 
generally beyond the scope of this review, although we 
get back to this issue in the conclusion. 

[18] K. A. Dill and J. L. MacCallum, Science 338, 1042 

( 2012 ). 

[19] S. Grzesiek and H.-J. Sass, Curr. Opin. Struct. Biol. 19, 
585 (2009). 

[20] R. Kerfah, M. J. Plevin, R. Sounier, P. Cans, and J. 
Boisbouvier, Curr. Opin. Struct. Biol. 32, 113 (2015). 

[21] A. Doerr, Nat. Meth. 11, 30 (2014). 

[22] X. Li, P. Mooney, S. Zheng, C. R. Booth, M. B. Braun- 
feld, S. Gubbens, D. A. Agard, and Y. Cheng, Nat. 
Meth. 10, 584 (2013). 

[23] X.-C. Bai, 1. S. Fernandez, G. McMullan, and S. H. 


Scheres, eLife 2, e00461 (2013). 

[24] K. Yonekura, K. Kato, M. Ogasawara, M. Tomita, and 
C. Toyoshima, Proc. Nat. Acad. Sci. U.S.A. 112, 3368 
(2015). 

[25] T. R. Barends et al. Nature 505, 244 (2014). 

[26] H. P. Stevenson et al., Proc. Natl. Acad. Sci. U.S.A. 
Ill, 8470 (2014). 

[27] M. P. Blakeley, P. Langan, N. Niimura, and A. Podjarny, 
Curr. Opin. Struct. Biol. 18, 593 (2008). 

[28] J. Janin, Prog. Biophys. Mol. Biol. 64, 145 (1995). 

29 J. P. K. Doye, A. A. Louis, and M. Vendruscolo, Phys. 
Biol. 1, P9 (2004). 

[30] J. P. K. Doye and W. C. K. Poon, Curr. Opin. Colloid 
Interface Sci. 11, 40 (2006). 

[31] C. A. Ross and M. A. Poirier, Nat. Med. 10, SIO (2004). 

[32] S. Kmoch, J. Brynda, B. Asfaw, K. Bezouska, P. Novak, 
P. Rezacova, L. Ondrova, M. Filipec, J. Sedlacek, and 
M. Elleder, Hum. Mol. Gen. 9, 1779 (2000). 

[33] A. Pande, J. Pande, N. Asherie, A. Lomakin, O. Ogun, 
J. King, and G. B. Benedek, Proc. Natl. Acad. Sci. 
U.S.A. 98, 6116 (2001). 

[34] P. Evans, K. Wyatt, G. J. Wistow, O. A. Bateman, 
B. A. Wallace, and C. Slingsby, J. Mol. Biol. 343, 435 
(2004). 

[35] F. F. Li, S. Z. Wang, C. Gao, S. G. Liu, B. J. Zhao, M. 
Zhang, S. Z. Huang, S. Q. Zhu, and X. Ma, Mol. Vis. 
14, 378 (2008). 

[36] J. Newman et al., Acta Cryst. F 68, 253 (2012). 

[37] A. E. Bruno, A. M. Ruby, J. R. Luft, T. D. Grant, J. 
Seetharaman, G. T. Montelione, J. F. Hunt, and E. H. 
Snell, PLoS ONE 9, el00782 (2014). 

[38] M. Caffrey, J. Struct. Biol. 142, 108 (2003). 

[39] H. Stevens, Life Out of Sequence: A Data-Driven His¬ 
tory of Bioinformatics (University of Chicago Press, 
Chicago, 2013), p. 272. 

[40] Z. Derewenda, Acta Cryst. D 66, 604 (2010). 

[41] M. J. Anderson, C. L. Hansen, and S. R. Quake, Proc. 
Natl. Acad. Sci. U.S.A. 103, 16746 (2006). 

[42] E. Saridakis and N. E. Chayen, Trends Biotechnol. 27, 
99 (2009). 

[43] A. C. Dumetz, A. M. Chockla, E. W. Kaler, and A. M. 
Lenhoff, Cryst. Growth & Des. 9, 682 (2009). 

[44] J. N. Israelachvili, Intermolecular and surface forces 
(Academic Press, San Diego, 1991). 

[45] D. Chandler, Nature 437, 640 (2005). 

[46] J. D. Gunton, A. Shiryayev, and D. L. Pagan, Protein 
Condensation (Cambridge University Press, New York, 
2007). 








10 


[47] S. Granick and S. C. Bae, Science 322, 1477 (2008). 

[48] B. J. Berne, J. D. Weeks, and R. Zhon, Annn. Rev. 
Phys. Chem. 60, 85 (2009). 

[49] P. G. Vekilov and A. A. Chernov, Solid State Phys. 57, 

1 ( 2002 ). 

[50] Y. D. Devedjiev, Acta Cryst. F 71, 157 (2015). 

[51] B. W. Matthews, J. Mol. Biol. 33, 491 (1968). 

[52] M. Malfois, F. Bonnete, L. Belloni, and A. Tardieu, J. 
Chem. Phys. 105, 3290 (1996). 

[53] A. Lomakin, N. Asherie, and G. B. Benedek, J. Chem. 
Phys. 104, 1646 (1996). 

[54] M. Muschol and F. Rosenberger, J. Chem. Phys. 107, 
1953 (1997). 

[55] A. Stradner, G. M. Thurston, and P. Schurtenberger, J. 
Phys.: Condens. Matter 17, S2805 (2005). 

[56] J. J. McManus, A. Lomakin, O. Ogun, A. Pande, M. 
Basan, J. Pande, and G. B. Benedek, Proc. Natl. Acad. 
Sci. U.S.A. 104, 16856 (2007). 

[57] S. Talreja, S. L. Perry, S. Guha, V. Bhamidi, C. F. 
Zukoski, and P. J. A. Kenis, J. Phys. Chem. B 114, 
4432 (2010). 

[58] C. Gogelein, D. Wagner, F. Cardinaux, G. Nagele, and 
S. U. Egelhaaf, J. Chem. Phys. 136, 015102 (2012). 

[59] J.-P. Hansen and 1. R. McDonald, Theory of simple liq¬ 
uids (Academic Press, London, 2006). 

[60] N. Asherie, Methods 34, 266 (2004). 

[61] N. Asherie, Protein Pept. Lett. 19, 708 (2012). 

[62] M. Sleutel, J. F. Lutsko, D. Maes, and A. E. S. 
Van Driessche, Phys. Rev. Lett. 114, 245501 (2015). 

[63] J. A. Thomson, P. Schurtenberger, G. M. Thurston, and 
G. B. Benedek, Proc. Natl. Acad. Sci. U.S.A. 84, 7079 
(1987). 

[64] A. C. Dnmetz, A. M. Chockla, E. W. Kaler, and A. M. 
Lenhoff, Biophys. J. 94, 570 (2008). 

[65] P. N. Pusey and W. Vanmegen, Nature 320, 340 (1986). 

[66] S. Asakura and F. Oosawa, J. Chem. Phys. 22, 1255 
(1954). 

[67] A. Vrij, Pure & Appl. Chem. 48, 471 (1976). 

[68] A. P. Cast, C. K. Hall, and W. B. Russel, J. Colloid 
Interface Sci. 96, 251 (1983). 

[69] H. N. W. Lekkerkerker, W. C. K. Poon, P. N. Pnsey, A. 
Stroobants, and P. B. Warren, Enrophys. Lett. 20, 559 
(1992). 

[70] M. H. J. Hagen and D. Frenkel, J. Chem. Phys. 101, 
4093 (1994). 

[71] N. Asherie, A. Lomakin, and G. B. Benedek, Phys. Rev. 
Lett. 77, 4832 (1996). 

[72] F. W. Tavares and S. 1. Sandler, AIChE J. 43, 218 
(1997). 

[73] M. A. Miller and D. Frenkel, Phys. Rev. Lett. 90, 
135702 (2003). 

[74] H. Liu, S. Garde, and S. Kumar, J. Chem. Phys. 123, 
174505 (2005). 

[75] D. L. Pagan and J. D. Gunton, J. Chem. Phys. 122, 
184515 (2005). 

[76] R. Lopez-Rendon, Y. Reyes, and P. Orea, J. Chem. 
Phys. 125, 084508 (2006). 

[77] J. Largo, M. A. Miller, and F. Sciortino, J. Chem. Phys. 
128, 134513 (2008). 

[78] A. Fortini, E. Sanz, and M. Dijkstra, Phys. Rev. E 78, 
041402 (2008). 

[79] M. G. Noro and D. Frenkel, J. Chem. Phys. 113, 2941 

( 2000 ). 

[80] G. FofB and F. Sciortino, Phys. Rev. E 74, 050401 (R) 


(2006). 

[81] F. Flatten, N. E. Valadez-Perez, R. Castaneda-Priego, 
and S. U. Egelhaaf, J. Chem. Phys. 142, 174905 (2015). 

[82] P. R. ten Wolde and D. Frenkel, Science 277, 1975 
(1997). 

[83] G. FofB, C. De Michele, F. Sciortino, and P. Tartaglia, 
J. Chem. Phys. 122, 224903 (2005). 

[84] S. Manley, H. M. Wyss, K. Miyazaki, J. C. Conrad, V. 
Trappe, L. J. Kaufman, D. R. Reichman, and D. A. 
Weitz, Phys. Rev. Lett. 95, 238302 (2005). 

[85] S. Sastry, Nature 441, 671 (2006). 

[86] P. Charbonneau and D. R. Reichman, Phys. Rev. E 75, 
050401 (R) (2007). 

[87] S. Bnzzaccaro, R. Rusconi, and R. Piazza, Phys. Rev. 
Lett. 99, 098301 (2007). 

[88] P. J. Lu, E. Zaccarelli, F. Ciulla, A. B. Schofield, F. 
Sciortino, and D. A. Weitz, Nature 453, 499 (2008). 

[89] D. Rosenbaum, P. C. Zamora, and C. F. Znkoski, Phys. 
Rev. Lett. 76, 150 (1996). 

[90] N. E. Chayen, Progr. Biophys. Mol. Biol. 88, 329 (2005). 

[91] A. George and W. W. Wilson, Acta Cryst. D 50, 361 
(1994). 

[92] C. Haas and J. Drenth, J. Phys. Chem. B 102, 4226 
(1998). 

[93] W. W. Wilson and L. J. DeLncas, Acta Cryst. F 70, 
543 (2014). 

[94] J. Janin and F. Rodier, Proteins 23, 580 (1995). 

[95] O. Carngo and P. Argos, Protein Sci. 6, 2261 (1997). 

[96] G. Pellicane, D. Costa, and C. Caccamo, J. Phys. Chem. 
B 108, 7538 (2004). 

[97] A. Stradner, G. Foffi, N. Dorsaz, G. Thurston, and P. 
Schurtenberger, Phys. Rev. Lett. 99, 198103 (2007). 

[98] N. Dorsaz, G. M. Thnrston, A. Stradner, P. Schnrten- 
berger, and G. Foffi, J. Phys. Chem. B 113, 1693 (2009). 

[99] G. Foffi, G. Savin, S. Bucciarelli, N. Dorsaz, G. M. 
Thurston, A. Stradner, and P. Schnrtenberger, Proc. 
Natl. Acad. Sci. U.S.A. Ill, 16748 (2014). 

[100] N. E. Chayen, E. Saridakis, and R. P. Sear, Proc. Natl. 
Acad. Sci. U.S.A. 103, 597 (2006). 

[101] A. J. Page and R. P. Sear, J. Am. Chem. Soc. 131, 
17550 (2009). 

[102] C. Haas, J. Drenth, and W. W. Wilson, J. Phys. Chem. 
B 103, 2808 (1999). 

[103] R. A. Curtis, H. W. Blanch, and J. M. Prausnitz, J. 
Phys. Chem. B 105, 2445 (2001). 

[104] A. Lomakin, N. Asherie, and G. B. Benedek, Proc. Natl. 
Acad. Sci. U.S.A. 96, 9465 (1999). 

[105] F. M. Richards, J. Mol. Biol. 82, 1 (1974). 

[106] H. Liu, S. K. Kumara, and F. Sciortino, J. Chem. Phys. 
127, 084902 (2007). 

[107] E. Bianchi, R. Blaak, and C. N. Likos, Phys. Chem. 
Chem. Phys. 13, 6397 (2011). 

[108] S. James, M. K. Quinn, and J. J. McManus, Phys. 
Chem. Chem. Phys. 17, 5413 (2015). 

[109] M. Cieslik and Z. S. Derewenda, Acta Cryst. D 65, 500 
(2009). 

[110] W. N. Price et al, Nat. Biotechnol. 27, 51 (2009). 

[111] R. P. Sear, J. Chem. Phys. Ill, 4800 (1999). 

[112] J. Chang, A. M. Lenhoff, and S. 1. Sandler, J. Chem. 
Phys. 120, 3003 (2004). 

[113] C. Gogelein, G. Nagele, R. Tuinier, T. Giband, A. Strad¬ 
ner, and P. Schurtenberger, J. Chem. Phys. 129, 085102 
(2008). 

[114] B. L. Neal, D. Asthagiri, and A. M. Lenhoff, Biophys. 



11 


J. 75, 2469 (1998). 

[115] H. Liu, S. K. Kumar, and J. F. Douglas, Phys. Rev. 
Lett. 103, 018101 (2009). 

[116] S. Whitelam, Phys. Rev. Lett. 105, 088102 (2010). 

[117] T. K. Haxton and S. Whitelam, Soft Matter 8, 3558 

( 2012 ). 

[118] E. Bianchi, J. Largo, P. Tartaglia, E. Zaccarelli, and F. 
Sciortino, Phys. Rev. Lett. 97, 168301 (2006). 

[119] G. Fofli and F. Sciortino, J. Phys. Chem. B 111, 9702 
(2007). 

[120] M. S. Wertheim, J. Stat. Phys. 35, 19 (1984). 

[121] M. S. Wertheim, J. Stat. Phys. 35, 35 (1984). 

[122] E. Bianchi, P. Tartaglia, E. La Nave, and F. Sciortino, 
J. Phys. Chem. B 111, 11765 (2007). 

[123] E. Bianchi, P. Tartaglia, E. Zaccarelli, and F. Sciortino, 
J. Chem. Phys. 128, 144504 (2008). 

[124] E. Bianchi, R. Blaak, and C. N. Likos, Phys. Chem. 
Chem. Phys. 13, 6397 (2011). 

[125] D. Fusco, J. J. Headd, A. De Simone, J. Wang, and P. 
Charbonneau, Soft Matter 10, 290 (2014). 

[126] S. C. Glotzer and M. J. Solomon, Nat. Mater. 6, 557 
(2007). 

[127] Y. Wang, Y. Wang, D. R. Breed, V. N. Manoharan, L. 
Feng, A. D. Hollingsworth, M. Week, and D. J. Pine, 
Nature 491, 51 (2012). 

[128] P. Charbonneau and D. Frenkel, J. Chem. Phys. 126, 
196101 (2007). 

[129] F. Romano, E. Sanz, and F. Sciortino, J. Chem. Phys. 
132, 184501 (2010). 

[130] F. Romano, E. Sanz, and F. Sciortino, J. Chem. Phys. 
134, 174502 (2011). 

[131] R. Fantoni, D. Gazzillo, A. Giacometti, M. A. Miller, 
and G. Pastore, J. Chem. Phys. 127, 234507 (2007). 

[132] S. Granick, S. Jiang, and Q. Chen, Physics Today 2009, 
68 (2009). 

[133] G. Munao, Z. Preisler, T. Vissers, F. Smallenburg, and 
F. Sciortino, Soft Matter 9, 2652 (2013). 

[134] B. Ruzicka, E. Zaccarelli, L. Znlian, R. Angelini, M. 
Sztucki, A. Monssaid, T. Narayanan, and F. Sciortino, 
Nat. Mat. 10, 56 (2010). 

[135] E. Krissinel, J. Comput. Chem. 31, 133 (2010). 

[136] S. W. Wukovitz and T. O. Yeates, Nat. Struct. Biol. 2, 
1062 (1995). 

[137] S. Dasgupta, G. H. Iyer, S. H. Bryant, C. E. Lawrence, 
and J. A. Bell, Proteins: Strncture, Function, and 
Bioinformatics 28, 494 (1997). 

[138] M. Chruszcz, W. Potrzebowski, M. D. Zimmerman, M. 
Grabowski, H. Zheng, P. Lasota, and W. Minor, Prot. 
Sci. 17, 623 (2008). 

[139] D. Fusco and P. Charbonneau, J. Phys. Chem. B 118, 
8034 (2014). 

[140] M. Hloucha, J. F. M. Lodge, A. M. Lenhoff, and S. 1. 
Sandler, J. Cryst. Growth 232, 195 (2001). 

[141] N. M. Dixit and C. F. Zukoski, J. Chem. Phys. 117, 
8540 (2002). 

[142] N. Dorsaz, L. Filion, F. Smallenburg, and D. Frenkel, 
Farad. Disc. 159, 9 (2012). 

[143] D. Fusco and P. Charbonneau, Phys. Rev. E 88, 012721 
(2013). 

[144] N. W. Price II et al, Microb. Inform. Exp. 1, 1 (2011). 

[145] G. Pellicane, G. Smith, and L. Sarkisov, Phys. Rev. 


Lett. 101, 248102 (2008). 

[146] A. Taudt, A. Arnold, and J. Pleiss, Phys. Rev. E 91, 
033311 (2015). 

[147] J. D. Schmit and K. Dill, J. Am. Chem. Soc. 134, 3934 

( 2012 ). 

[148] Z. Liu, W.-P. Zhang, Q. Xing, X. Ren, M. Liu, and C. 
Tang, Angew. Chem. Inti. Ed. 51, 469 (2012). 

[149] S. E. Acnner Ozbabacan, H. B. Engin, A. Gursoy, and 
O. Keskin, Protein Eng. Des. Sel 24, 635 (2011). 

[150] C. J. Lanci, C. M. MacDermaid, S.-g. Kang, R. Acharya, 
B. North, X. Yang, X. J. Qin, W. F. DeCrado, and J. G. 
Saven, Proc. Natl. Acad. Sci. U.S.A. 109, 7304 (2012). 

[151] M. V. King, B. S. Magdoff, M. B. Adelman, and D. 
Marker, Acta Cryst. 9, 460 (1956). 

[152] M. V. King, J. Bello, E. H. Pignataro, and D. Marker, 
Acta Cryst. 15, 144 (1962). 

[153] M. Dixon, H. Nicholson, L. Shewchuk, W. Baase, and 
B. Matthews, J. Mol. Biol. 227, 917 (1992). 

[154] H. Faber and B. Matthews, Nature 348, 263 (1990). 

[155] D. E. McRee, S. M. Bedford, T. E. Meyer, and M. A. 
Cusanovich, J. Biol. Chem. 265, 5364 (1990). 

[156] A. V. Elgersma, M. Ataka, and T. Katsura, J. Cryst. 
Growth 122, 31 (1992). 

[157] P. E. Wright and H. J. Dyson, Nature Rev. Mol. Cell. 
Biol. 16, 18 (2015). 

[158] M. Budayova-Spano, F. Bonnet, J.-P. Astier, and S. 
Veesler, J. Cryst. Growth 235, 547 (2002). 

[159] M. Budayova-Spano, S. Lafont, J.-P. Astier, C. Ebel, 
and S. Veesler, J. Cryst. Growth 217, 311 (2000). 

[160] S. Veesler, N. Fert, M.-S. Costes, M. Czjzek, and J.-P. 
Astier, Cryst. Growth & Design 4, 1137 (2004). 

[161] N. Asherie, J. Pande, A. Pande, J. A. Zarutskie, J. Lo¬ 
makin, A. Lomakin, O. Ogun, L. J. Stern, J. King, and 
G. B. Benedek, J. Mol. Biol. 314, 663 (2001). 

[162] A. Shiryayev, D. L. Pagan, J. D. Gunton, D. S. Rhen, A. 
Saxena, and T. Lookman, J. Chem. Phys. 122, 234911 
(2005). 

[163] N. Wentzel and J. D. Gunton, J. Phys. Chem. B 112, 
7803 (2008). 

[164] E. P. Carpenter, K. Beis, A. D. Cameron, and S. Iwata, 
Curr. Opin. Struct. Biol. 18, 581 (2008). 

[165] M. Caffrey, Annu. Rev. Biophys. 38, 29 (2009). 

[166] A. Bhattacharya, Nature 459, 24 (2009). 

[167] M. F. Hagan, Adv. Chem. Phys. 155, 1 (2014). 

[168] N. S. Bieler, T. P. J. Knowles, D. Frenkel, and R. Vacha, 
PLoS Comput. Biol. 8, el002692 (2012). 

[169] A. Saric, Y. C. Chebaro, T. P. J. Knowles, and D. 
Frenkel, Proc. Natl. Acad. Sci. U.S.A. Ill, 17869 
(2014). 

[170] L. E. Kay, Who Wrote the Book of Life?: A History of 
the Genetic Code (Stanford University Press, Stanford, 
2000 ). 

[171] M. E. Johnson and G. Hummer, Proc. Natl. Acad. Sci. 
U.S.A. 108, 603 (2011). 

[172] K. D. Wilkinson, in Protein-Protein Interactions: Meth¬ 
ods and Protocols (Humana Press, New York, 2004), 
Vol. 261, pp. 15-31. 

[173] T. Zhnang, B. K. Jap, and C. R. Sanders, J. Am. Chem. 
Soc. 133, 20571 (2011). 



